Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 92711 |
| Missing cells | 5 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 13.4 MiB |
| Average record size in memory | 152.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 10 |
Variable descriptions
| edad | Edad de los clientes. |
|---|---|
| facturacion | Dinero que pagan los clientes al mes. |
| antiguedad | Fecha de alta del cliente. |
| provincia | Provincia de los clientes. |
| num_lineas | Numero de lineas moviles contratadas. |
| num_lineas_impago | Numero de lineas en impago. |
| incidencia | SI = el cliente ha tenido alguna incidencia o reclamacion. |
| conexion | Tipo de conexion de internet del cliente. |
| vel_conexion | Velocidad de conexion de internet. |
| TV | Tipo de paquete de tv contratado por el cliente. |
| num_llamad_ent | Numero de llamadas entrantes de todas sus lineas. |
| num_llamad_sal | Numero de llamadas salientes de todas sus lineas. |
| mb_datos | Mb de los datos consumidos en todas sus lineas. |
| seg_llamad_ent | Segundos consumidos en llamadas entrantes. |
| seg_llamad_sal | Segundos consumidos en llamadas salientes. |
| financiacion | SI = el cliente tiene financiado algun terminal. |
| imp_financ | El dinero mensual que paga por los terminales financiados. |
| descuentos | SI = el cliente tiene activado algun descuento. |
antiguedad has a high cardinality: 92237 distinct values | High cardinality |
conexion is highly correlated with vel_conexion | High correlation |
vel_conexion is highly correlated with conexion | High correlation |
antiguedad is uniformly distributed | Uniform |
facturacion has unique values | Unique |
num_llamad_sal has 936 (1.0%) zeros | Zeros |
imp_financ has 86045 (92.8%) zeros | Zeros |
Reproduction
| Analysis started | 2022-05-04 15:40:28.458511 |
|---|---|
| Analysis finished | 2022-05-04 15:41:04.254840 |
| Duration | 35.8 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
edad
Real number (ℝ≥0)
Edad de los clientes.
| Distinct | 68 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51.42923709 |
| Minimum | 18 |
|---|---|
| Maximum | 85 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 34 |
| median | 51 |
| Q3 | 68 |
| 95-th percentile | 82 |
| Maximum | 85 |
| Range | 67 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 19.58591326 |
|---|---|
| Coefficient of variation (CV) | 0.380832273 |
| Kurtosis | -1.198694328 |
| Mean | 51.42923709 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 0.001051985051 |
| Sum | 4768056 |
| Variance | 383.6079982 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 37 | 1461 | 1.6% |
| 60 | 1431 | 1.5% |
| 72 | 1426 | 1.5% |
| 53 | 1418 | 1.5% |
| 20 | 1414 | 1.5% |
| 47 | 1404 | 1.5% |
| 24 | 1404 | 1.5% |
| 50 | 1404 | 1.5% |
| 27 | 1402 | 1.5% |
| 32 | 1402 | 1.5% |
| Other values (58) | 78545 |
| Value | Count | Frequency (%) |
| 18 | 1380 | |
| 19 | 1302 | |
| 20 | 1414 | |
| 21 | 1368 | |
| 22 | 1327 |
| Value | Count | Frequency (%) |
| 85 | 1319 | |
| 84 | 1289 | |
| 83 | 1329 | |
| 82 | 1346 | |
| 81 | 1379 |
| Distinct | 92711 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 207.4886998 |
| Minimum | 15.00043941 |
|---|---|
| Maximum | 399.9984328 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 15.00043941 |
|---|---|
| 5-th percentile | 34.47479163 |
| Q1 | 111.3683852 |
| median | 207.0893664 |
| Q3 | 304.349361 |
| 95-th percentile | 380.6885379 |
| Maximum | 399.9984328 |
| Range | 384.9979934 |
| Interquartile range (IQR) | 192.9809758 |
Descriptive statistics
| Standard deviation | 111.2394756 |
|---|---|
| Coefficient of variation (CV) | 0.5361230549 |
| Kurtosis | -1.20486072 |
| Mean | 207.4886998 |
| Median Absolute Deviation (MAD) | 96.48298181 |
| Skewness | 0.004058335396 |
| Sum | 19236484.85 |
| Variance | 12374.22094 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 216.0281089 | 1 | < 0.1% |
| 361.0878322 | 1 | < 0.1% |
| 244.1644781 | 1 | < 0.1% |
| 178.4619084 | 1 | < 0.1% |
| 222.5037351 | 1 | < 0.1% |
| 328.7416895 | 1 | < 0.1% |
| 359.3658789 | 1 | < 0.1% |
| 260.9736906 | 1 | < 0.1% |
| 238.8083378 | 1 | < 0.1% |
| 133.0520124 | 1 | < 0.1% |
| Other values (92701) | 92701 |
| Value | Count | Frequency (%) |
| 15.00043941 | 1 | |
| 15.00449739 | 1 | |
| 15.01707741 | 1 | |
| 15.02045972 | 1 | |
| 15.02213553 | 1 |
| Value | Count | Frequency (%) |
| 399.9984328 | 1 | |
| 399.9974432 | 1 | |
| 399.9915826 | 1 | |
| 399.9852974 | 1 | |
| 399.9835731 | 1 |
| Distinct | 92237 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 01/07/2020 03:55 PM | 3 |
|---|---|
| 01/09/2020 02:33 PM | 3 |
| 01/14/2020 05:08 PM | 3 |
| 01/25/2020 12:51 PM | 3 |
| 01/19/2020 04:57 PM | 3 |
| Other values (92232) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 91769 ? |
|---|---|
| Unique (%) | 99.0% |
Sample
| 1st row | 11/23/2018 08:48 AM |
|---|---|
| 2nd row | 08/22/2017 03:19 AM |
| 3rd row | 12/27/2001 01:50 PM |
| 4th row | 08/08/2015 10:53 AM |
| 5th row | 11/04/1997 11:43 AM |
Common Values
| Value | Count | Frequency (%) |
| 01/07/2020 03:55 PM | 3 | < 0.1% |
| 01/09/2020 02:33 PM | 3 | < 0.1% |
| 01/14/2020 05:08 PM | 3 | < 0.1% |
| 01/25/2020 12:51 PM | 3 | < 0.1% |
| 01/19/2020 04:57 PM | 3 | < 0.1% |
| 01/07/2020 10:37 PM | 3 | < 0.1% |
| 01/15/2020 07:33 AM | 2 | < 0.1% |
| 01/18/2008 12:49 AM | 2 | < 0.1% |
| 01/27/2020 11:04 PM | 2 | < 0.1% |
| 10/14/2003 04:41 AM | 2 | < 0.1% |
| Other values (92227) | 92685 |
Length
| Value | Count | Frequency (%) |
| pm | 46394 | 16.7% |
| am | 46317 | 16.7% |
| 01/05/2020 | 174 | 0.1% |
| 12:37 | 165 | 0.1% |
| 01:44 | 163 | 0.1% |
| 01/03/2020 | 163 | 0.1% |
| 03:54 | 161 | 0.1% |
| 01/10/2020 | 161 | 0.1% |
| 01:08 | 159 | 0.1% |
| 02:00 | 159 | 0.1% |
| Other values (9874) | 184117 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
provincia
Categorical
Provincia de los clientes.
| Distinct | 50 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| Valencia | 1941 |
|---|---|
| Asturias | 1934 |
| Murcia | 1931 |
| Navarra | 1930 |
| Zaragoza | 1927 |
| Other values (45) |
Length
| Max length | 22 |
|---|---|
| Median length | 7 |
| Mean length | 7.603596121 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | La Rioja |
|---|---|
| 2nd row | Vizcaya |
| 3rd row | Albacete |
| 4th row | Lugo |
| 5th row | Huelva |
Common Values
| Value | Count | Frequency (%) |
| Valencia | 1941 | 2.1% |
| Asturias | 1934 | 2.1% |
| Murcia | 1931 | 2.1% |
| Navarra | 1930 | 2.1% |
| Zaragoza | 1927 | 2.1% |
| Málaga | 1924 | 2.1% |
| Alicante | 1895 | 2.0% |
| Orense | 1891 | 2.0% |
| Guipúzcoa | 1886 | 2.0% |
| Zamora | 1879 | 2.0% |
| Other values (40) | 73573 |
Length
| Value | Count | Frequency (%) |
| la | 3683 | 3.4% |
| valencia | 1941 | 1.8% |
| asturias | 1934 | 1.8% |
| murcia | 1931 | 1.8% |
| navarra | 1930 | 1.8% |
| zaragoza | 1927 | 1.8% |
| málaga | 1924 | 1.8% |
| alicante | 1895 | 1.8% |
| orense | 1891 | 1.8% |
| guipúzcoa | 1886 | 1.8% |
| Other values (47) | 86560 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
num_lineas
Categorical
Numero de lineas moviles contratadas.
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 3 | |
|---|---|
| 4 | |
| 5 | |
| 2 | |
| 1 | 2759 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 5 |
|---|---|
| 2nd row | 3 |
| 3rd row | 4 |
| 4th row | 4 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 29071 | |
| 4 | 25927 | |
| 5 | 22161 | |
| 2 | 12793 | |
| 1 | 2759 | 3.0% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 3 | 29071 | |
| 4 | 25927 | |
| 5 | 22161 | |
| 2 | 12793 | |
| 1 | 2759 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
num_lineas_impago
Categorical
Numero de lineas en impago.
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 0.0 | |
|---|---|
| 4.0 | 685 |
| 3.0 | 652 |
| 2.0 | 639 |
| 1.0 | 638 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 90097 | |
| 4.0 | 685 | 0.7% |
| 3.0 | 652 | 0.7% |
| 2.0 | 639 | 0.7% |
| 1.0 | 638 | 0.7% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 90097 | |
| 4.0 | 685 | 0.7% |
| 3.0 | 652 | 0.7% |
| 2.0 | 639 | 0.7% |
| 1.0 | 638 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
incidencia
Categorical
SI = el cliente ha tenido alguna incidencia o reclamacion.
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| NO | |
|---|---|
| SI | 1991 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NO |
|---|---|
| 2nd row | NO |
| 3rd row | NO |
| 4th row | NO |
| 5th row | NO |
Common Values
| Value | Count | Frequency (%) |
| NO | 90720 | |
| SI | 1991 | 2.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| no | 90720 | |
| si | 1991 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
num_llamad_ent
Real number (ℝ≥0)
Numero de llamadas entrantes de todas sus lineas.
| Distinct | 251 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 125.1098359 |
| Minimum | 0 |
|---|---|
| Maximum | 250 |
| Zeros | 389 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 62 |
| median | 125 |
| Q3 | 188 |
| 95-th percentile | 238 |
| Maximum | 250 |
| Range | 250 |
| Interquartile range (IQR) | 126 |
Descriptive statistics
| Standard deviation | 72.42107473 |
|---|---|
| Coefficient of variation (CV) | 0.5788599608 |
| Kurtosis | -1.196924576 |
| Mean | 125.1098359 |
| Median Absolute Deviation (MAD) | 63 |
| Skewness | 0.003331937553 |
| Sum | 11599058 |
| Variance | 5244.812065 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 188 | 443 | 0.5% |
| 243 | 420 | 0.5% |
| 158 | 416 | 0.4% |
| 226 | 409 | 0.4% |
| 62 | 407 | 0.4% |
| 144 | 407 | 0.4% |
| 139 | 406 | 0.4% |
| 159 | 405 | 0.4% |
| 228 | 405 | 0.4% |
| 203 | 405 | 0.4% |
| Other values (241) | 88588 |
| Value | Count | Frequency (%) |
| 0 | 389 | |
| 1 | 375 | |
| 2 | 323 | |
| 3 | 356 | |
| 4 | 349 |
| Value | Count | Frequency (%) |
| 250 | 390 | |
| 249 | 402 | |
| 248 | 391 | |
| 247 | 373 | |
| 246 | 374 |
| Distinct | 101 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.85895956 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 936 |
| Zeros (%) | 1.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 25 |
| median | 50 |
| Q3 | 75 |
| 95-th percentile | 95 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 50 |
Descriptive statistics
| Standard deviation | 29.20854901 |
|---|---|
| Coefficient of variation (CV) | 0.5858234761 |
| Kurtosis | -1.204273467 |
| Mean | 49.85895956 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 0.008586540307 |
| Sum | 4622474 |
| Variance | 853.1393351 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 998 | 1.1% |
| 25 | 980 | 1.1% |
| 37 | 971 | 1.0% |
| 14 | 970 | 1.0% |
| 38 | 970 | 1.0% |
| 72 | 968 | 1.0% |
| 35 | 968 | 1.0% |
| 89 | 967 | 1.0% |
| 5 | 964 | 1.0% |
| 24 | 962 | 1.0% |
| Other values (91) | 82993 |
| Value | Count | Frequency (%) |
| 0 | 936 | |
| 1 | 913 | |
| 2 | 907 | |
| 3 | 896 | |
| 4 | 951 |
| Value | Count | Frequency (%) |
| 100 | 998 | |
| 99 | 895 | |
| 98 | 909 | |
| 97 | 915 | |
| 96 | 906 |
mb_datos
Real number (ℝ≥0)
Mb de los datos consumidos en todas sus lineas.
| Distinct | 24393 |
|---|---|
| Distinct (%) | 26.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12510.1905 |
| Minimum | 0 |
|---|---|
| Maximum | 25000 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1250.5 |
| Q1 | 6232.5 |
| median | 12526 |
| Q3 | 18742 |
| 95-th percentile | 23748 |
| Maximum | 25000 |
| Range | 25000 |
| Interquartile range (IQR) | 12509.5 |
Descriptive statistics
| Standard deviation | 7217.671483 |
|---|---|
| Coefficient of variation (CV) | 0.5769433716 |
| Kurtosis | -1.200950925 |
| Mean | 12510.1905 |
| Median Absolute Deviation (MAD) | 6260 |
| Skewness | -0.003898539754 |
| Sum | 1159832271 |
| Variance | 52094781.64 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13465 | 12 | < 0.1% |
| 22543 | 12 | < 0.1% |
| 18137 | 12 | < 0.1% |
| 23818 | 12 | < 0.1% |
| 10063 | 12 | < 0.1% |
| 6257 | 12 | < 0.1% |
| 12180 | 11 | < 0.1% |
| 4350 | 11 | < 0.1% |
| 8048 | 11 | < 0.1% |
| 19894 | 11 | < 0.1% |
| Other values (24383) | 92595 |
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 1 | 6 | |
| 2 | 4 | |
| 3 | 4 | |
| 4 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 25000 | 6 | |
| 24999 | 4 | |
| 24998 | 1 | < 0.1% |
| 24997 | 3 | |
| 24996 | 6 |
seg_llamad_ent
Real number (ℝ≥0)
Segundos consumidos en llamadas entrantes.
| Distinct | 19815 |
|---|---|
| Distinct (%) | 21.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9985.382781 |
| Minimum | 0 |
|---|---|
| Maximum | 20000 |
| Zeros | 8 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1014 |
| Q1 | 4960 |
| median | 9998 |
| Q3 | 14981 |
| 95-th percentile | 18998 |
| Maximum | 20000 |
| Range | 20000 |
| Interquartile range (IQR) | 10021 |
Descriptive statistics
| Standard deviation | 5774.903324 |
|---|---|
| Coefficient of variation (CV) | 0.5783356983 |
| Kurtosis | -1.200962746 |
| Mean | 9985.382781 |
| Median Absolute Deviation (MAD) | 5010 |
| Skewness | 0.004930637947 |
| Sum | 925754823 |
| Variance | 33349508.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1863 | 16 | < 0.1% |
| 2676 | 16 | < 0.1% |
| 1883 | 14 | < 0.1% |
| 14137 | 14 | < 0.1% |
| 3979 | 14 | < 0.1% |
| 17210 | 14 | < 0.1% |
| 15036 | 13 | < 0.1% |
| 1557 | 13 | < 0.1% |
| 18311 | 13 | < 0.1% |
| 8851 | 13 | < 0.1% |
| Other values (19805) | 92571 |
| Value | Count | Frequency (%) |
| 0 | 8 | |
| 1 | 5 | |
| 2 | 2 | < 0.1% |
| 3 | 4 | |
| 4 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 20000 | 5 | |
| 19999 | 4 | |
| 19998 | 4 | |
| 19997 | 7 | |
| 19996 | 7 |
seg_llamad_sal
Real number (ℝ≥0)
Segundos consumidos en llamadas salientes.
| Distinct | 19798 |
|---|---|
| Distinct (%) | 21.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10030.44396 |
| Minimum | 0 |
|---|---|
| Maximum | 20000 |
| Zeros | 6 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1002 |
| Q1 | 5010 |
| median | 10037 |
| Q3 | 15036 |
| 95-th percentile | 19011 |
| Maximum | 20000 |
| Range | 20000 |
| Interquartile range (IQR) | 10026 |
Descriptive statistics
| Standard deviation | 5786.754197 |
|---|---|
| Coefficient of variation (CV) | 0.5769190496 |
| Kurtosis | -1.203101787 |
| Mean | 10030.44396 |
| Median Absolute Deviation (MAD) | 5016 |
| Skewness | -0.00466271821 |
| Sum | 929932490 |
| Variance | 33486524.14 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19562 | 15 | < 0.1% |
| 8315 | 15 | < 0.1% |
| 75 | 14 | < 0.1% |
| 14760 | 14 | < 0.1% |
| 13760 | 13 | < 0.1% |
| 4828 | 13 | < 0.1% |
| 9090 | 13 | < 0.1% |
| 16371 | 13 | < 0.1% |
| 13878 | 13 | < 0.1% |
| 16149 | 13 | < 0.1% |
| Other values (19788) | 92575 |
| Value | Count | Frequency (%) |
| 0 | 6 | |
| 1 | 5 | |
| 2 | 4 | |
| 3 | 4 | |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 20000 | 4 | |
| 19999 | 7 | |
| 19998 | 6 | |
| 19997 | 3 | < 0.1% |
| 19996 | 8 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 1.4 MiB |
| ADSL | |
|---|---|
| FIBRA |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.497459794 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FIBRA |
|---|---|
| 2nd row | FIBRA |
| 3rd row | ADSL |
| 4th row | FIBRA |
| 5th row | FIBRA |
Common Values
| Value | Count | Frequency (%) |
| ADSL | 46590 | |
| FIBRA | 46119 | |
| (Missing) | 2 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| adsl | 46590 | |
| fibra | 46119 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 1.4 MiB |
| 200MB | |
|---|---|
| 600MB | |
| 300MB | |
| 50MB | |
| 100MB | |
| Other values (6) |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.398584804 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 50MB |
|---|---|
| 2nd row | 600MB |
| 3rd row | 35MB |
| 4th row | 200MB |
| 5th row | 200MB |
Common Values
| Value | Count | Frequency (%) |
| 200MB | 9342 | |
| 600MB | 9299 | |
| 300MB | 9212 | |
| 50MB | 9167 | |
| 100MB | 9099 | |
| 20MB | 7882 | |
| 25MB | 7840 | |
| 10MB | 7807 | |
| 30MB | 7761 | |
| 35MB | 7672 |
Length
| Value | Count | Frequency (%) |
| 200mb | 9342 | |
| 600mb | 9299 | |
| 300mb | 9212 | |
| 50mb | 9167 | |
| 100mb | 9099 | |
| 20mb | 7882 | |
| 25mb | 7840 | |
| 10mb | 7807 | |
| 30mb | 7761 | |
| 35mb | 7672 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
TV
Categorical
Tipo de paquete de tv contratado por el cliente.
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| tv-futbol | |
|---|---|
| tv-familiar | |
| tv-total |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 9.560300288 |
| Min length | 8 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | tv-futbol |
|---|---|
| 2nd row | tv-futbol |
| 3rd row | tv-futbol |
| 4th row | tv-familiar |
| 5th row | tv-futbol |
Common Values
| Value | Count | Frequency (%) |
| tv-futbol | 46191 | |
| tv-familiar | 32822 | |
| tv-total | 13698 | 14.8% |
Length
Pie chart
| Value | Count | Frequency (%) |
| tv-futbol | 46191 | |
| tv-familiar | 32822 | |
| tv-total | 13698 | 14.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
financiacion
Categorical
SI = el cliente tiene financiado algun terminal.
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| NO | |
|---|---|
| SI | 6666 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NO |
|---|---|
| 2nd row | NO |
| 3rd row | NO |
| 4th row | NO |
| 5th row | NO |
Common Values
| Value | Count | Frequency (%) |
| NO | 86045 | |
| SI | 6666 | 7.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| no | 86045 | |
| si | 6666 | 7.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 6667 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.601432833 |
| Minimum | 0 |
|---|---|
| Maximum | 39.99195402 |
| Zeros | 86045 |
| Zeros (%) | 92.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 15.3968231 |
| Maximum | 39.99195402 |
| Range | 39.99195402 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 6.366160907 |
|---|---|
| Coefficient of variation (CV) | 3.975290612 |
| Kurtosis | 17.59324097 |
| Mean | 1.601432833 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.244524075 |
| Sum | 148470.4394 |
| Variance | 40.5280047 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 86045 | |
| 10.24354491 | 1 | < 0.1% |
| 18.59953244 | 1 | < 0.1% |
| 26.95092346 | 1 | < 0.1% |
| 16.98623804 | 1 | < 0.1% |
| 33.63331701 | 1 | < 0.1% |
| 5.816686074 | 1 | < 0.1% |
| 15.95684633 | 1 | < 0.1% |
| 27.30485137 | 1 | < 0.1% |
| 13.80874237 | 1 | < 0.1% |
| Other values (6657) | 6657 | 7.2% |
| Value | Count | Frequency (%) |
| 0 | 86045 | |
| 5.009998664 | 1 | < 0.1% |
| 5.013309309 | 1 | < 0.1% |
| 5.021417588 | 1 | < 0.1% |
| 5.025074875 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 39.99195402 | 1 | |
| 39.99012758 | 1 | |
| 39.98897814 | 1 | |
| 39.98756476 | 1 | |
| 39.97837601 | 1 |
descuentos
Categorical
SI = el cliente tiene activado algun descuento.
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| NO | |
|---|---|
| SI |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NO |
|---|---|
| 2nd row | SI |
| 3rd row | SI |
| 4th row | NO |
| 5th row | NO |
Common Values
| Value | Count | Frequency (%) |
| NO | 72673 | |
| SI | 20038 | 21.6% |
Length
Pie chart
| Value | Count | Frequency (%) |
| no | 72673 | |
| si | 20038 | 21.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| edad | facturacion | antiguedad | provincia | num_lineas | num_lineas_impago | incidencia | num_llamad_ent | num_llamad_sal | mb_datos | seg_llamad_ent | seg_llamad_sal | conexion | vel_conexion | TV | financiacion | imp_financ | descuentos | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 63 | 216.028109 | 11/23/2018 08:48 AM | La Rioja | 5 | 0.0 | NO | 95 | 19 | 6525 | 7634 | 18520 | FIBRA | 50MB | tv-futbol | NO | 0.000000 | NO |
| 1 | 84 | 255.830842 | 08/22/2017 03:19 AM | Vizcaya | 3 | 0.0 | NO | 44 | 36 | 14471 | 14541 | 8016 | FIBRA | 600MB | tv-futbol | NO | 0.000000 | SI |
| 2 | 66 | 135.768153 | 12/27/2001 01:50 PM | Albacete | 4 | 0.0 | NO | 94 | 27 | 1428 | 5248 | 7106 | ADSL | 35MB | tv-futbol | NO | 0.000000 | SI |
| 3 | 69 | 255.658527 | 08/08/2015 10:53 AM | Lugo | 4 | 0.0 | NO | 186 | 20 | 20083 | 7372 | 5052 | FIBRA | 200MB | tv-familiar | NO | 0.000000 | NO |
| 4 | 51 | 99.348645 | 11/04/1997 11:43 AM | Huelva | 4 | 0.0 | NO | 37 | 32 | 19078 | 5009 | 8686 | FIBRA | 200MB | tv-futbol | NO | 0.000000 | NO |
| 5 | 55 | 88.062883 | 06/14/1996 01:44 AM | Lérida | 4 | 0.0 | NO | 78 | 96 | 3032 | 5118 | 11695 | ADSL | 25MB | tv-futbol | SI | 31.553269 | NO |
| 6 | 21 | 73.076377 | 07/02/2004 12:35 PM | La Coruña | 4 | 0.0 | NO | 183 | 9 | 16442 | 7771 | 13478 | ADSL | 30MB | tv-futbol | NO | 0.000000 | NO |
| 7 | 30 | 395.481514 | 03/26/2018 10:22 PM | Alicante | 3 | 0.0 | NO | 152 | 16 | 17184 | 10493 | 11638 | ADSL | 35MB | tv-total | NO | 0.000000 | NO |
| 8 | 64 | 391.692196 | 09/15/2004 01:49 AM | Córdoba | 5 | 0.0 | NO | 97 | 43 | 10961 | 10288 | 13798 | ADSL | 10MB | tv-futbol | NO | 0.000000 | SI |
| 9 | 80 | 199.380443 | 07/26/2011 01:33 AM | Las Palmas | 2 | 0.0 | NO | 187 | 41 | 14428 | 9837 | 14834 | FIBRA | 100MB | tv-total | NO | 0.000000 | SI |
Last rows
| edad | facturacion | antiguedad | provincia | num_lineas | num_lineas_impago | incidencia | num_llamad_ent | num_llamad_sal | mb_datos | seg_llamad_ent | seg_llamad_sal | conexion | vel_conexion | TV | financiacion | imp_financ | descuentos | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 92701 | 55 | 316.728460 | 05/01/2011 11:41 PM | Asturias | 5 | 0.0 | NO | 143 | 34 | 4551 | 18534 | 9 | ADSL | 10MB | tv-total | SI | 28.355596 | NO |
| 92702 | 75 | 32.297445 | 12/02/2008 03:40 AM | Tarragona | 2 | 0.0 | NO | 96 | 14 | 296 | 2761 | 9241 | FIBRA | 200MB | tv-futbol | NO | 0.000000 | NO |
| 92703 | 58 | 375.658420 | 06/09/2016 09:39 PM | Santa Cruz de Tenerife | 5 | 0.0 | NO | 141 | 11 | 6740 | 4596 | 11926 | FIBRA | 100MB | tv-total | NO | 0.000000 | NO |
| 92704 | 32 | 15.570680 | 01/18/2013 12:54 PM | Tarragona | 2 | 0.0 | NO | 173 | 58 | 3128 | 18340 | 2873 | FIBRA | 200MB | tv-futbol | NO | 0.000000 | SI |
| 92705 | 65 | 173.741667 | 03/05/2019 12:00 AM | Murcia | 5 | 0.0 | NO | 42 | 17 | 3943 | 10085 | 14566 | ADSL | 35MB | tv-familiar | SI | 23.138779 | NO |
| 92706 | 36 | 215.890326 | 04/09/2013 01:33 PM | Guadalajara | 3 | 0.0 | NO | 217 | 96 | 9059 | 7735 | 8823 | ADSL | 30MB | tv-futbol | NO | 0.000000 | NO |
| 92707 | 68 | 285.890750 | 08/08/2003 11:57 PM | Asturias | 5 | 0.0 | NO | 168 | 99 | 9303 | 4798 | 3996 | FIBRA | 200MB | tv-futbol | SI | 14.616422 | NO |
| 92708 | 20 | 383.167610 | 03/27/2013 08:07 PM | Álava | 4 | 0.0 | NO | 188 | 71 | 19018 | 1237 | 16720 | ADSL | 20MB | tv-futbol | NO | 0.000000 | NO |
| 92709 | 53 | 53.301395 | 01/18/2020 02:30 AM | Sevilla | 2 | 0.0 | NO | 138 | 40 | 20264 | 10552 | 17637 | FIBRA | 50MB | tv-futbol | NO | 0.000000 | NO |
| 92710 | 18 | 57.158927 | 10/22/2009 07:17 PM | Las Palmas | 4 | 0.0 | NO | 217 | 65 | 21772 | 14141 | 927 | ADSL | 25MB | tv-familiar | NO | 0.000000 | SI |